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Abstract 

Fault-tolerance  is  an  important  issue  in  network  design  because  sensor  networks  must 
function  in  a  dynamic,  uncertain  world.  In  this  paper,  we  propose  a  functional  characterization 
of  the  fault-tolerant  integration  of  abstract  interval  estimates.  This  model  provides  a  test  bed 
for  a  general  framework  which  we  hope  to  develop  to  address  the  general  problem  of  fault- 
tolerant  integration  of  abstract  sensor  estimates.  We  further  propose  a  scheme  for  narrowing 
the  width  of  the  sensor  output  in  a  specific  failure  model  and  give  it  a  functional 
representation. 

The  main  distinguishing  feature  of  our  model  over  the  original  Marzullo's  model  is  in 
reducing  the  width  of  the  output  interval  estimate  significantly  in  most  cases  where  the  number 
of  sensors  involved  is  large. 
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In  recent  years,  the  increasing  sophistication  of  surveillance  systems 
and  tracking  mechanisms  has  generated  a  great  deal  of  interest  in  the 
development  of  new  computational  structures  and  strategies  for  detecting 
and  tracking  multiple  targets,  using  data  from  many  sensors. 

The  design  of  spatially  distributed  target-detection-and-tracking  systems 
involves  the  integration  of  solutions  obtained  by  solving  subproblems  in 
data-association,  hypothesis-testing,  data-fusion,  etc.  [13].  This  must  include 
the  cooperative  solution  of  problems  by  a  decentralized  and  loosely  coupled 
collection  of  processors,  each  of  which  integrates  information  received  from  a 
cluster  of  spatially  distributed  sensors  into  a  manageable  and  reliable  output 
for  further  integration  at  a  higher  level.  Integration  of  information  at  the 
sensor  level  requires  techniques  to  be  developed  to  abstractly  represent  and 
integrate  sensor  information.  Further  these  techniques  have  to  be  robust  in 
the  sense  that  even  if  some  of  the  sensors  are  faulty,  the  integrated  output 
should  still  be  reliable.  For  details  on  multi  sensor  integration  and  fusion  in 
intelligent  systems,  see  [  5,6,7,8,9,10,11,14]. 

The  aim  of  this  paper  is  to  present  a  fault-tolerant  computational 
model  for  sensor  integration  in  Distributed  Sensor  Networks. 

A  Distributed  Sensor  Network  (DSN)  consists  of  spatially  distributed 
sensors  which  detect  and  measure  a  certain  phenomenon  via  its  changing 
parameters.  These  readings  are  sent  at  regular  intervals  of  time  to  processing 
units  which  integrate  the  readings  from  clusters  of  sensors  and  give  outputs 
whose  nature  is  much  the  same  as  the  inputs  of  the  sensors.  Outputs  from 
processors  representing  clusters  of  sensors  are  later  integrated  to  get  a 
complete  picture  of  the  spatially  distributed  phenomenon.  However,  before 
integration  is  performed  at  the  processor  level,  it  is  necessary  to  have  reliable 
estimates  at  each  processor.  Each  sensor  in  a  cluster  measures  the  same  set  of 
parameters.  It  is  possible  that  .some  of  these  sensors  are  faulty.  Hence  it  is 
desirable  to  make  use  of  the  redundancy  of  the  readings  in  the  cluster  to 

obtain  a  correct  estimate  of  the  parameter  being  read.  In  short,  a  fault-tolerant 
technique  of  sensor  integration  to  obtain  the  correct  estimate  is  sought. 


1.1  Scope  of  TJtis  Paper 


This  paper  has  two  objectives:  The  first  objective  is  to  propose  a 
functional  characterization  of  fault-tolerant  integration  of  abstract  interval 
estimates  considered  by  Marzullo  [4].  The  second  objective  is  to  propose  a 
modified  computational  scheme  of  integration  carried  out  by  Marzullo  [4]  in 
the  case  when  the  number  of  sensors  is  large,  wherein  it  is  possible  to 
improve  the  accuracy  of  the  integrated  output. 

The  main  distinguishing  feature  of  our  model  over  the  original 
Marzullo's  model  is  in  reducing  the  width  of  the  output  interval  estimate 
significantly  in  most  cases  where  the  number  of  sensors  involved  is  large. 

Elsewhere  we  intend  to  generalize  Marzullo's  approach  to  the  cases 
when  the  sensor  outputs  are  subsets  of  an  abstract  parameter  space.  The 
functional  characterization  of  the  fault-tolerant  integration  of  abstract 
interval  estimates  described  in  this  paper  hints  at  an  abstract  framework.  We 
hope  to  develop  for  addressing  the  general  problem  of  fault-tolerant 
integration  of  sensor  outputs. 

1.2  Organization  of  the  Paper 

In  section  2,  we  describe  Marzullo’s  work  on  sensor  integration  and 
other  related  work.  Our  abstract  model  functional  characterization  is  detailed 
in  section  3  and  is  an  extension  of  the  model  proposed  by  Marzullo.  In  section 
4,  we  motivate  the  need  for  a  new  failure  model  and  present  the  information 
integration  algorithm  with  a  specific  example.  Finally,  we  close  the  paper 
with  concluding  remarks  and  future  directions  this  reach  would  take. 

2.0  RELATED  WORK 


Marzullo  [4]  considers  the  case  of  a  processor  receiving  input  from 
several  sensors  whose  outputs  are  connected  intervals.  He  gives  a  fault- 
tolerant  integration  algorithm  which  takes  as  input  the  intervals  representing 
the  sensors  and  gives  as  output  of  the  processor  a  connected  interval 
representing  the  sensor  values.  More  precisely:  Let  there  be  n  sensors,  each  of 
which  yields  an  interval  as  its  output,  there  sensors  measure  a  certain 
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physical  value  and  their  intervals  contain  the  physical  value  unless  they 
happen  to  be  faulty  sensors. 

Thus,  a  correct  sensor  is  one  which  contains  the  actual  physical  value 
in  its  interval.  Any  two  correct  sensors  must  overlap  since  they  both  contain 
the  physical  value  being  measured. 

Marzullo  considers  the  case  when  almost  f  sensors  are  faulty  and  gives 
an  algorithm  which  yields  a  connected  interval  as  the  output  of  the  processor, 
containing  the  physical  value. 

If  at  most  f  of  the  n  sensors  are  faulty,  then  it  follows  that  at  least  n-f 
sensors  are  correct.  Marzullo  considers  all  possible  nonempty 
(n-f)-intersections  of  the  n-sensors.  A  sensor  which  does  not  belong  to  any  of 
the  (n-f)-cliques  is  faulty  since  a  correct  sensor  overlaps  with  at  least  (n-f-1) 
other  correct  sensors.  One  and  only  one  of  the  (n-O-intersections  contains  the 
physical  value.  Since  it  is  not  possible  to  decide  which  intersection  has  the 
physical  value  (which  is  as  yet  unknown  to  us)  and  since  the  processor  output 
is  required  to  be  a  connected  interval,  the  smallest  connected  interval 
containing  all  the  (n-f)-intersections  is  taken  to  be  the  output  of  the  processor. 
It  is  easy  to  see  that  it  contains  the  actual  physical  value.  The  wider  this 
interval  is,  the  lesser  the  accuracy  of  the  processor  output.  Marzullo  proves 
the  existence  of  bounds  for  the  width  of  this  interval  in  terms  of  f. 

The  example  described  below  provide  a  description  of  integration 
process. 
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Figure  1.  Integration  Interval  estimates 

( aT<a3<  a2<b3<bi<b2<a6<a5<a4<b6<b5<b4) 
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In  the  above  figure,  we  have  the  intervals  Ij  =  [aj,  bj]  1  <  j  <  6. 
Overlapping  one  another  according  to  the  strict  chain  of  inequalities  given 
above.  Here  /  =  3  and  n  =  6.  So,  taking  all  possible  (n  -  0  intersections  gives  us 
the  intervals  [a2,  b3]  and  [a4,  b6].  Then  enclosing  these  intervals  in  the  smallest 
possible  connected  interval,  we  have  the  integrated  output  interval  Ip  given 
by  Ip  =  [  32,  b6]. 

In  the  statistical  literature,  the  popular  methods  for  the  combining  the 
point  estimates  of  several  (possibly  faulty)  sensors  into  a  single  point  estimate 
come  under  the  designation  of  robust  estimation.  This  family  includes  the 
median,  Huber  function  based  methods,  least  median  square  methods,  etc. 
[1,2].  Some  of  these  methods  have  been  applied  to  the  sensor  fusion  problem 
in  [3]. 


However,  in  methods  like  a  median,  there  is  no  easy  way  of  including 
the  additional  information  such  as  "at  most  f  out  of  n  sensors  are  faulty". 
Moreover,  all  of  these  methods  are  geared  to  the  generation  of  point 
estimates  whereas  our  paper  concentrates  on  the  interval  estimates. 

The  main  thrust  in  our  paper  is  in  the  derivation  of  computational 
schemes  for  narrowing  the  width  of  the  processor  output  in  a  specific  failure 
model  and  give  it  a  functional  representation. 


2J.  Interval  Representation  of  Sensor  Readings 

A  sensor  reads  a  physical  variable  and  gives  a  number  as  its  output. 
However  a  sensor  is  prone  to  inaccuracies  and  there  may  be  some  uncertainty 
in  the  value  of  its  output.  The  simplest  modeling  of  this  is  achieved  by 
looking  upon  sensor  outputs  as  connected  intervals  on  the  real  line  rather 
than  as  points.  The  actual  value  representing  the  physical  variable  being 
measured  is  taken  to  be  contained  in  the  interval  associated  with  the  sensor  if 
the  sensor  is  not  faulty.  No  assumptions  are  made  about  the  width  of  these 
intervals  or  their  position  on  the  real  line.  Thus  each  sensor  value  is 
represented  by  an  interval  estimate.  We  make  this  notion  precise  in  the 
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following  definitions  that  are  useful  in  characterizing  one  model  of  sensor 
integration. 

Definition  1  :  An  abstract  sensor  is  a  sensor  which  reads  a  physical 
parameter  and  gives  out  an  abstract  interval  estimate  Ij,  which 
is  a  bounded  and  connected  subset  of  the  real  line  R. 

Definition  2  :  A  correct  sensor  is  an  abstract  sensor  whose  interval  estimate 
contains  the  actual  value  of  the  parameter  being  measured.  If 
the  interval  estimate  does  not  contain  the  actual  value  of  the 
parameters  being  measured,  it  is  called  a  faulty  sensor. 

Definition  3  :  Let  sensors  si,...,Sn  feed  into  a  processor  P.  Let  the  abstract 
interval  estimate  of  Sjbe  Ij  1<  j  <  n,  where  Ij  is  the  closed 
interval  [aj,bj]  with  end-points  ajand  bj  .  Define  the 
characteristic  function  Xj  of  the  sensor  Sj  1<  j  <  n  as  follows  : 

Xj  •  {o»n 

5CjW  =  {oV«r;  Vl<j£n. 


1  - 

Xi 

0 

C 

li  1 

- 

- 

Figure  2.  Representation  of  intervals 


The  ne.xt  section  addresses  the  question  of  how  the  abstract  sensors  or  abstract 
estimates  are  combined  to  yield  new  abstract  estimates. 


3.0  THE  PROBLEM  OF  FAULT-TOLERANT  SENSOR 
INTEGRATION 


Fault  tolerance  is  a  crucial  requirement  to  be  satisfied  by  a  distributed 
sensor  network  for  it  to  be  reliable  in  a  situation  where  one  or  more  of  its 
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sensors  fails  and  yield  faulty  readings.  Fault  tolerance  in  a  distributed  sensor 
network  thus  implies  that  the  values  of  the  parameters  measured  by  the 
network  are  reliably  reflected  in  its  output  even  though  some  of  the  sensors 
may  be  faulty.  We  propose  to  introduce  fault  tolerance  into  the  distributed 
sensor  network  by  providing  a  method  of  integration  of  sensor  values  that 
yields  a  reliable  output  which  reflects  the  correct  values  of  the  parameters 
being  measured  with  high  fidelity. 

Recall  that  our  definition  of  a  faulty  sensor  is  one  where  abstract  sensor 
estimate  (interval  estimate)  does  not  count  in  the  actual  physical  value  being 
measured. 

The  problem  of  fault-tolerant  sensor  integration  is  the  integration  of 
the  Ij  (1<  j  <n),  to  obtain  an  abstract  interval  estimate  Ip  =  [ap  ,bp]  which  is  a 
’reliable’  and  'fairly  accurate'  estimate  of  the  region  in  which  the  physical 
sensor  value  lies.This  integration  should  be  fault-tolerant  in  that  its  reliability 
should  not  be  severely  affected  by  some  of  the  sensors  being  faulty.  In  other 
words,  we  seek  to  obtain  a  functional  relationship  between  the  characteristic 
function  Xp  of  Ip  =  [ap,bp]  and  the  Xj  1  <  j  <  n:  Xp(x)  =  f(Xi(x), ...,  Xn(x) )  such  that 
Xp'HD  is  a  fault-tolerant  interval  estimate  of  the  physical  value  being 
measured. 

We  now  go  about  obtaining  a  functional  representation  of  the 
integrated  output  estimate  under  the  integration  scheme  of  Marzullo.  In 
order  to  do  this  we  need  to  introduce  a  few  relevant  operations  and  functions. 
The  following  definition  provide  such  operations  for  our  integration 
problem: 


Definition  1:  If  f(x)  is  a  real-valued  function,  define  ||f||  =sup{|f(x)|  |  x  e  R} 
(norm  of  f)  .  Here  sup  stands  for  the  supremum.  That  is,  ||f||  is  the 
smallest  real  number  a  such  that  f(x)  <  a  V  x  e  R. 

Definition  2:  If  f(x)  is  a  real  valued  function  define  Supp  f  =  {x  I  f(x)  0) 
(support  of  f ). 
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n 

Definition  3:  Let  0(x)  =  'overlap  function'.  For  each  x  e  R 

i=^ 

,  0(x)  gives  the  number  of  intervals  in  which  x  lies  or  the  number 
of  intervals  overlapping  at  the  point  x. 

Remark  1:  The  integer  ||XjO(x)||  gives  the  maximum  number  of  intervals 

in  which  any  x  e  fj. 

Proof :  Indeed,  0(x)  is  the  number  of  intervals  overlapping  at  the  point  x. 
Multiplying  by  Xi(x)  restricts  )0(x)  to  the  interval  Ij 

llXjOII  therefore  gives  the  maximum  number  of  intervals 
overlapping  at  any  x  e  fj. 

Then  L  belongs  to  llXjO||-clique.  Where,  by  a  n-clique  we  mean  a 
group  of  n  intervals  having  a  common  intersection.  ■ 

If  x,(x)  and  Xi(x)  are  characteristic  functions  of  intervals  and  Ij  then  the 
characteristic  function  of  the  interval  linlj  denoted  %unij(x)  is  given  by  the 
product  Xi(x)  Xi(x). 

If  Xi(x)  is  the  characteristic  function  of  Ij,  then  the  characteristic 
function  of  I^^  (the  complement  set  of  li)  is  given  by  (1-  Xi(x)). 

Thus  if  I],  ...,In  are  intervals  with  characteristic  functions  Xn  -vXn  /  then 
the  characteristic  function  of  their  union  %^^Ii  (x)  is  given  by 

X^iiW  =  1  -  n(l- Xi(x)). 

i=l 

Marzullo[4]  assumes  that  there  are  at  most  f  faulty  sensors  among  n 
sensors,  and  considers  the  intersections  of  (n-0  or  more  sensors  as  the  regions 
in  which  the  correct  physical  value  lies.  An  interval  which  does  not 
participate  in  any  (n-f)-intersection  is  taken  to  be  the  estimate  of  a  faulty 
sensor.  The  output  is  the  smallest  interval  which  contains  all  (n-O  or  more 
intersections. 
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3.1.  Computational  Characterization 

We  now  obtain  a  functional  characterization  of  the  (n  -  f)  intersections 
and  this  integrated  output  estimate  as  described  in  the  previous  section.  More 
precisely,  we  give  an  explicit  expression  for  this  characteristic  function  of  the 
(n  -  f)  intersection  in  terms  of  the  characteristic  function  of  the  intervals 
corresponding  to  the  sensor  estimates. 

Remark  2:  If  at  most  f  sensors  are  faulty,  then  we  need  to  consider  only  those 
h's  for  which  Iboll  >  (n-f).  Thus  the  characteristic  function  of 
the  set  of  all  points  lying  in  (n-f)  or  more  intersections  of  the 
intervals  Ij  (1  <  j  <  n)  is  given  by  : 

n 

S(x)=  1  -11(1  -X[„-f.»o)(llXiOll)Xj(x)) 
j=i 

Where  X[n-f.=o)is  the  characteristic  function  of  the  interval  [n-f,“")- 

Proof :  Indeed  X[n-f,~)(llXi  OH)  =  1  iff  HXi  OH  —  “'^'f 

i.e.  iff  Ij  has  at  least  n-f-1  other  intervals  intersecting  it  at  some  point  in 
it. 

So  X[n-f.~)(HXi  Oll)Xj  (x)  =  Xj  (x)  if  and  only  if  Ij  is  as  described  above.  So 
that  S(x)  is  the  support  of  the  characteristic  function  of  all  those  points 
which  belong  to  (n-f)  or  more  intervals'  intersection.  ■ 

Now  the  correct  physical  value  belongs  to  Supp(S(x)),  i.e;  to  one  of  the 
intervals  constituting  it.  Marzullo  proposes  the  smallest  connected  interval 
containing  Supp(S(x))  for  the  integrated  output. 

More  precisely  the  output  interval  estimate  Ip  is  given  by  ; 

Ip  =  [  Min  {x  1  S(x)  =  1 },  Max  (x  I  S(x)  =  1 }] 

The  above  integration  technique  does  indeed  give  a  connected  Interval 
within  which  the  actual  physical  values  lies.  It  however  includes  points 
which  do  not  belong  to  the  intervals  constituting  Supp(S(x)). 
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Furthermore,  if  the  intervals  constituting  Supp(S(x))  are  widely  scattered  over 
k 

then  there  are  wide  gaps  between  these  intervals  which  do  not 
i=l 

contain  the  physical  value  and  yet  are  included  in  the  output  estimate. 
This  results  in  the  width  of  the  smallest  interval  containing  Supp(S(x)) 

k 

being  comparable  to  the  smallest  interval  containing  (see  Fig.  1)  and 

i=l 

this  is  of  little  value  from  the  point  of  accuracy.  We  need  to  evolve  a 
criterion  by  which  we  can  pick  only  a  few  of  the  intervals  constituting 
Supp(S(x))  with  a  fair  amount  of  certainty  that  they  enclose  the  correct 
physical  value.  The  rest  of  our  analysis  is  in  this  direction. 


4.0.  A  NEW  FAILURE  MODEL  WITH  SHARPER  OUTPUT 
INTERVAL  ESTIMATES 

We  propose  a  failure  model  in  which  it  is  possible  to  choose  in  most 
cases  a  subset  of  Supp(S(x))  as  the  region  of  correct  sensor  value  instead  of  the 
whole  of  Ip  as  defined  above.  A  sensor  may  fail  wildly,  in  which  case  there  is 
no  correlation  between  the  actual  physical  value  being  measured  and  the 
interval  estimate  of  the  faulty  sensor.  On  the  other  hand,  a  sensor  may  fail 
tamely,  in  which  case  although  the  faulty  sensor's  interval  estimate  does  not 
contain  the  actual  physical  value,  the  interval  estimate  lies  significantly  close 
to  the  value  in  a  certain  sense.  For  example,  mechanical  vibrations  may 
induce  a  tame  fault  in  dials  and  meters  by  shifting  the  needless  fluctuations  to 
a  region  which  does  not  contain  the  correct  value  but  lies  close  to  it.  Since  we 
do  not  know  the  actual  physical  value,  we  cannot  detect  the  tameness  of  a 
fault  directly.  However  tamely  faulty  sensor  estimates  tend  to  overlap  with 
correct  sensor  estimates  because  of  their  proximity  to  the  actual  physical 
value.  We  consider  the  case  when  the  number  of  sensors  to  be  integrated  is 
very  large  and  assume  that  most  of  the  faulty  sensors  are  tamely  faulty.  In  this 
case,  we  observe  that  correct  sensors  have  a  relatively  larger  number  of 
intervals  overlapping  with  them  as  compared  to  undetected  faulty  sensors 
participating  in  the  (n-f)-intersections,  since  tamely  faulty  sensors  overlap 
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with  correct  sensor  estimates. Thus  the  number  of  sensor  estimates 
overlapping  with  a  given  sensor  estimate  is  a  good  index  of  its  correctness. 
We  make  use  of  this  observation  to  narrow  our  output  interval  estimate, 
namely  Ip. 

k 

Let  Supp(S(x))=  where  Li=  [ai,pi]  with  Pi  <  Oj+i  V  1  <  i  <  k-1.  Wf* 
i=i 

now  perform  an  evaluation  of  the  Li’s  in  order  to  attach  a  weight  to  each  of 
them  and  choose  those  Li's  with  maximum  weight  to  be  the  intervals  which 
have  a  high  likelihood  of  containing  the  correct  physical  value.  We  then 
again  enclose  these  Li's  of  maximum  weight  by  the  smallest  possible  interval 
and  take  it  to  be  the  output  estimate. 

Remark  3:  Let  Xli(x)  be  the  characteristic  function  of  Li.  Then  we  can  define 

n 

the  popularity  of  the  sensor  to  be  the  number  Pj  =  El|XuX,|l- 

k=l 

1  .  Pj  gives  the  number  of  sensor  intervals  overlapping  with 
the  sensor  interval. 

Proof:  Indeed  IMI 

n 

Thus  ^r||X  kXj|l-  1  gives  the  number  of  intervals  (apart  from  Ij) 

k=I 

intersecting  with  Ij.  ■ 


4.1  Narrowing  of  the  Output  Interval  Estimate  Width 

The  L,  s  are  (n  -  f)  intersections.  The  reliability  ri  gives  a  measure  of  the 
clustering  of  sensors  around  the  Lj.  Our  assumption  that  most  of  the  faulty 
sensors  are  tamely  faulty  implies  that  the  Li  with  maximum  clustering 
around  it  is  most  likely  to  contain  the  correct  physical  value  measured  by  the 
sensors.  The  sum  of  the  popularities  of  the  sensors  involved  in  the  formation 
of  Lj  is  a  good  index  of  the  clustering  of  sensors  about  Lj. 
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Hence,  we  would  like  to  take  the  sum  of  the  popularities  of  all  sensors 
involved  in  the  formation  of  Li,  and  call  it  the  reliability  rj  of  Li. 

n 

Consider  the  set  function  W(Li)=  SIIXL.%jllPj  ,1  <  i  <  k  defined  on 

j=i 

each  L,.  W(Li)  gives  the  sum  of  the  number  of  intervals  overlapping  with 
each  sensor  estimate  in  the  (n-f)-or-more  clique  Li. 

i.e.,  r.  =  W(L0  V  1  <  i  <  k. 

Let  r  =  max  {r.  1  1  <  i  <  k),  m  =  min  {i  1  r,  =  r}  and  M  =  max  {i  1  r;  =  r}. 
Consider  the  interval  [ttm.pM]-  We  take  Ip*  =  [am»PM]  as  the  integrated  output 
estimate. 

It  is  clear  that  Pm  -  ttm  =  Up’l  <  |Ipl,  where  1 1 1  is  the  width  of  the  interval 
I.  Thus  in  our  failure  model  we  have  in  general  a  way  of  narrowing  the 
output  estimate  Ip  to  Ip*.  However  if  the  number  of  wildly  fault)  sensors  are 
as  many  as  the  tamely  faulty  ones,  and  if  they  happen  to  cluster  somewhere 
else  on  Ip,then  it  is  possible  that  llp*l  =  |Ipl.  Thus  the  worst  case  for  Ip*  is  Ip.  The 
chances  that  wildly  faulty  sensors  mimic  the  clustering  behavior  of  tamely 
faulty  sensors  are  remote.  Also  if  the  number  of  sensors  is  very  small,  it  is 
possible  that  llp*l  =  |Ipl. 

For  example,  consider  the  case  of  three  input  sensor  estimates 
ll  =[2.4,3.21,  l2=[2.9,4],  l3=[3.6,5].  In  this  casc  Ip=[2.9,4].  Here  L,=[2.9,3.2]  and  L2=[3.6,4], 
but  they  both  have  the  same  reliability  .Hence  Ip*=Ip  here. 
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4.2  The  Algorithm 


We  now  present  the  algorithm  as  follows: 

Algorithm  : 

Input  :  Intervals  11,12,...,  In,  h 
Output:  Integrated  output  estimate, 
begin 

1. Take  all  (n-f)-intersections  of  the  intervals  to  yield  Intervals 

Li, . .  Lie,  each  of  which  is  an  (n-O-intersection  :  {  Lj  =  [^aj,  bjj  }; 

2.  For  each  i  (1  <  i  <  k) 

i.  Count  the  number  of  intervals  intersecting  each  of  the  intervals 
Ij  (1  <  j  <  n)  having  nonempty  intersection  with  Lj; 

ii.  Add  these  numbers  up  to  obtain  a  number  rj  {  rj  gives  the  sum 
of  the  number  of  intervals  intersecting  with  intervals  involved 

in  the  fomiation  of  the  Lj.  a  is  a  measure  of  the  reliability  of  Li ); 

3.  Choose  the  maximum  of  the  ri  (1  <  i  <  k)  and  call  it  r, 

4.  Let  m  =  min{  i  I  q  =  r  }  and  M  =  max{  i  I  ri  =  r  ); 

* 

5.  Assign  Ip  =  [am,  bM]  to  be  the  integrated  output  estimate; 

end. _ 

4,3  A  Comparison  of  Performance 

In  this  new  model,  we  find  that  when  the  number  of  sensors  is  very 
large,  by  taking  the  clustering  of  the  tamely  faulty  sensors  into  consideration, 
we  reduce  the  output  intervals  width  greatly,  as  compared  to  Marzullo's  [4] 
output  interval  estimate. 

The  Figure  3  below  illustrates  the  superior  performance  of  our  model 
clearly.  The  numbers  near  each  interval  estimate  gives  the  number  of 
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intervals  overlapping  with  it.  Here  n  =  13  and  f  (maximum  number  of  fault 
intervals  allowed)  =  10.  The  The  thick  lines  in  Figure  3  are  the  intervals  Lj 
with  the  numbers  on  them  indicating  their  reliabilities.  We  may  pick  either 
the  interval  with  higher  reliability  or  define  a  range  for  reliabilities  and  pick 
intervals  which  fall  in  these  limits.  The  thick  line  of  the  bottom  indicates  the 
output  interval  estimate  for  this  case  in  Marzullo's  model. 
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Output  estimate  — 
in  Marzullo's  model 
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Output  estimates^ 
from  our  model  with  maximum  reliabilities 


Figure  3.  A  comparison  of  performance 


Table  1.  Popularities  of  Intervals 


Interval 

II 

12 

l3 

B 

B 

4 

I7 

l8 

I9 

IlO 

Ill 

I12 

Il3 

B 

2 

B 

B 

3 

2 

4 

B 

B 

2 

2 

4 

3 

The  essential  gain  in  the  model  proposed  above  is  that  the 
(n-f)-intersections  are  assigned  weights  which  are  in  the  above  mentioned 
sents  reliability  estimates  of  these  intersections.  We  may  now  impose  any 
convenient  rule  for  choosing  these  segments  according  to  their  reliabilities. 
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In  table  1,  we  have  each  interval  assigned  a  numbering  which  is  its 
popularity. 

The  (n  -  f)  or  more  intersections  (i.e.  3  to  4-intersections  here)  which 
form  the  output  have  reliabilities  8,  12,  15,  15,  10. 

Thus,  the  advantage  here  is  that  we  have  the  intersection  weighted  to 
help  us  judiciously  choose  the  output  intervals.  We  may  employ  any 
convenient  rule  depending  upon  our  faith  in  the  tameness  of  the  faults  to 
pick  these  intervals  and  enclose  them  by  connected  interval.  For  instance,  we 
may  choose  only  those  intervals  with  maximum  reliability  (in  this  case,  the 
intervals  with  reliability  15)  and  enclose  them  by  a  connected  interval.  It  is 
clear  that  the  worst  possible  width  for  the  final  output  interval  estimate  is  the 
smallest  interval  containing  all  intersections  irrespective  of  their  weights. 


4.3  Narrowing  width  of  Intervals  in  the  case  of  non-uniform 

di^triMions. 

So  far  we  have  assumed  that  a  sensor  does  not  give  any  information 
about  the  probability  distribution  of  the  correct  physical  value  over  the 
interval  (abstract  sensor  estimate)  that  represents  it.  In  this  case,  we  assume 
that  each  value  in  this  interval  is  equally  likely,  i.e.,  we  assume  a  uniform 
distribution  of  the  physical  value  over  each  interval  estimate.  However,  in 
the  event  of  the  sensor  giving  a  unimodal  probability  distribution  of  physical 
value  over  its  interval  estimate,  say,  for  example,  a  normal  distribution,  then 
we  may  use  this  additional  information  to  narrow  the  interval  widths 
beforehand  by  restoring  to  confidence  interval  estimates  of  the  physical  value. 
The  resulting  subset  of  this  estimation  is  a  connected  interval  whose  width  is 
less  than  the  original  interval.  More  precisely,  if  S  is  a  sensor  with  its  interval 
estimate  I  =  [  a,  b  ]  and  p(x)  is  a  unimodal  probability  distribution  on  I  of  the 
physical  value  measured  by  the  sensor,  and  further  if  (1  -  a)  is  a  high 
probability,  we  need  to  compute  members  L(a),  U(a)  such  that 

P[  X  G  [  [  L(a),  U(a)  ]]  =  1  -  a 
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U(a) 

=  j  p(x)dx 
L(a) 

where  [  L(a),  U(a)  ]  is  called  100(1  -  a)%  confidence  interval  for  the  physical 
value  X,  and  (1  -  a)  is  called  the  level  of  confidence  associated  with  the 
interval. 

Now,  [  L(a),  U(a)  ]  —  [  a,  b  ]  if  a  ’^Owith  a  high  probability  of 
containing  the  physical  value  (if  S  is  not  faulty)  and  if  is  narrower  than  the 
original  interval  [  a,  b  ].  It  is  obvious  that  the  widths  of  the  L;  decrease  if  the 
width  of  one  or  more  of  the  input  intervals  decrease.  Thus  the  narrowing  of 
the  intervals  before  integration  when  possible  increases  the  accuracy  of  the 
output. 


5.0  CONCLUDING  REMARKS 

In  order  to  address  the  general  problem  of  fault-tolerant  sensor 
integration  for  a  large  class  of  sensors,  it  is  necessary  to  evolve  a  broad-based 
computational  framework  which  can  accommodate  a  wide  range  of  sensors 
and  a  variety  of  fault  tolerant  integration  techniques  depending  upon  the 
phenomenon  being  sensed  and  the  method  of  sensing.  We  intend  to  develop 
a  calculus  of  sensor  integration  by  regarding  the  sensor  estimates  as  subsets  of 
an  abstract  parameter  space  and  obtaining  functional  representations  of  the 
characteristics  of  these  estimates.  We  then  intend  to  obtain  rules  for 
combining  these  functions  to  get  functions  describing  the  characteristics  of  the 
output  according  to  the  kind  of  integration  that  is  required  to  be  performed. 
This  paper  is  a  preliminary  exercise  in  concert  with  this  effort.  We  have 
recast  the  fault-tolerant  integration  of  abstract  interval  estimates  a  la  Marzullo 
[4]  in  a  computational  framework,  and  considered  a  failure  model  wherein 
we  could  reduce  the  width  of  the  output  interval  estimate  significantly  in 
most  cases  where  the  number  of  sensors  involved  is  large. 
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